Unsupervised acoustic model adaptation based on phoneme error minimization
نویسندگان
چکیده
In this paper, a new decoding method for unsupervised acoustic model adaptation is presented. In unsupervised adaptation framework, the effectiveness of adaptation process is greatly affected by the mis-recognized labels. Therefore, selection of the adaptation data guided by the confidence measures is effective in unsupervised adaptation. We propose phoneme error minimization framework for exact phoneme labels and use of phoneme-level confidence measures for improved unsupervised adaptation. Experimental results showed that the proposed method could reduce the mis-recognized labels in the adaptation process, and consequently improved the adaptation accuracy. Furthermore, it was confirmed that the proposed method is effective in an iterative unsupervised adaptation framework.
منابع مشابه
Unsupervised Acoustic Model Adaptati Minimizatio
In this paper, a new decoding method for unsupervised acoustic model adaptation is presented. In unsupervised adaptation framework, the effectiveness of adaptation process is greatly affected by the mis-recognized labels. Therefore, selection of the adaptation data guided by the confidence measures is effective in unsupervised adaptation. We propose phoneme error minimization framework for exac...
متن کاملSpeech Data Clustering Based on Phoneme Error Trend for Unsupervised Acoustic Model Adaptation
Unsupervised cluster adaptive training of acoustic models offers promise in improving recognition accuracy, especially for speech recognition systems that store massive sets of speech samples from unknown people. How to classify the variety of acoustic characteristics is an important problem in adaptation sample clustering. We propose a novel speech sample clustering method that focuses on the ...
متن کاملRapid unsupervised adaptation using frame independent output probabilities of gender and context independent phoneme models
Business is demanding higher recognition accuracy with no increase in computation time compared to previously adopted baseline speech recognition systems. Accuracy can be improved by adding a gender dependent acoustic model and unsupervised adaptation based on CMLLR (Constrained Maximum Likelihood Linear Regression). CMLLR-based batch-type unsupervised adaptation estimates a single global trans...
متن کاملUnsupervised adaptation for acoustic language identification
Our system for automatic language identification (LID) of spoken utterances is performed with language dependent parallel phoneme recognition (PPR) using Hidden Markov Model (HMM) phoneme recognizers and optional phoneme language models (LMs). Such a LID system for continuous speech requires many hours of orthographically transcribed data for training of language dependent HMMs and LMs as well ...
متن کاملAn Empirical Study of Word Error Minimization Approaches for Mandarin Large Vocabulary Continuous Speech Recognition
This paper presents an empirical study of word error minimization approaches for Mandarin large vocabulary continuous speech recognition (LVCSR). First, the minimum phone error (MPE) criterion, which is one of the most popular discriminative training criteria, is extensively investigated for both acoustic model training and adaptation in a Mandarin LVCSR system. Second, the word error minimizat...
متن کامل